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METHODS FOR IMPROVING 
TRANSFORMATION EFFICIENCY 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a Continuation-In-Part of Provisional application Serial 
No. 60/035,560 filed January 14, 1997. 

5 FIELD OF THE INVENTION 

The invention relates to the genetic modification of organisms, particularly 

plants. 

BACKGROUND OF THE INVENTION 

10 Gene transfer has offered great promise in the genetic manipulation of 

organisms. The movement of genes within plant species has played an important 
role in crop improvement for many decades. The recombinant DNA methods 
which have been developed have greatly extended the sources from which genetic 
information can be obtained for crop improvement. Gene transfer systems based on 

15 recombinant DNA are available for several crop species and are under development 

for many others. 

Rapid progress has been made in developing the tools for manipulating 
genetic information in plants. Plant genes are being cloned, genetic regulatory 
signals deciphered, and genes transferred from entirely unrelated organisms to 
20 confer new agriculturally useful traits to crop plants. Recombinant DNA methods 

significantly increase the gene pool available for crop improvement. 

A variety of methods have been developed for the transformation of plants 
and plant cells with DNA. Generally, the most success has been in dicotyledonous 
plants. Some success has been reported with certain monocotyledonous cereals. 
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Cereals comprise a commercially valuable group of plant species that could 
benefit from the introduction and expression of foreign genes controlling improved 
grain quality and such agronomically important traits as tolerance to disease, 
insects, herbicides, and stress. However, most cereals have not proven readily 
5 amenable to either Agrobacterium-mediated gene delivery, or to the routine 

regeneration of fertile transgenic plants from directly transformed protoplasts. The 
use of microprojectile-bombardment-mediated transformation of embryogenic tissue 
culture material, with the subsequent regeneration of transgenic plants, has 
overcome the regeneration problems associated with the production of plants from 

10 cereal protoplasts. Using this technology, transgenic plants have been obtained 

from microprojectile-bombarded tissue cultures of many species. 

Many of the recent advances in plant science have resulted from application 
of the analytical power of recombinant DNA technology coupled with plant 
transformation. These approaches facilitate studies of the effects of specific gene 

15 alterations and additions on plant development and physiology. They also make 

possible the direct manipulation of genes to bio-engineer improved plant varieties. 

While strides have been made in the genetic transformation of plants, it is by 
no means a routine matter. In fact, transformation efficiency is quite low making 
the process very labor intensive. Some reports indicate that the current 

20 transformation methods provide only a transformation frequency of about one event 

from every thousand bombarded embryos. This transformation frequency is too 
low for many genetic studies and for routine commerical applications. Therefore, a 
method is needed to improve the efficiency of genetic transformation. 

25 SUMMARY OF THE INVENTION 

Compositions and methods for improving transformation efficiency in 
organisms, particularly plants, are provided. The compositions, synthetic marker 
genes, are used in transformation methods and result in increased transformation 
efficiency. The synthetic marker genes can be designed for maximum expression in 
30 any system. 
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DESCRIPTION OF THE FIGURES 

Figure 1 : Nucleotide sequence of phosphinothricin acetyltransferase 
optimized for expression in monocots. 

5 Figure 2: Nucleotide sequence of cyanamide hydratase optimized for 

expression in monocots. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is drawn to compositions and methods for improving 
10 transformation efficiency in organisms, particularly plants. For use in plants, the 

method involves stably transforming a plant cell or culture and regenerating plants 
from the transformed cells. Using the methods of the invention, fertile transgenic 
plants can be grown to maturity with a high frequency. The fertile transformed 
plants are capable of producing transformed progeny that express foreign genes of 
15 interest. 

The methods of the present invention improve transformation efficiency. By 
improve efficiency it is intended that the number of transformed plants recovered by 
a transformation attempt is increased preferably at least two fold, preferably at least 
five fold, more preferably at least ten fold. 
20 The present invention thus encompasses the fertile transgenic plants and 

transformed seeds thereof, as well as the subsequent progeny and products derived 
therefrom. 

By transformation is intended the genetic manipulation of the plant, cell, cell 
line, callus, tissue, plant part, and the like. That is, such cell, cell line, callus, 
25 tissue, plant part, or plant which has been altered by the presence of recombinant 

DNA wherein said DNA is introduced into the genetic material within the cell, 
either chromosomally, or extra-chromosomally. Recombinant DNA includes 
foreign DNA, heterologous DNA, exogenous DNA, and chimeric DNA. 

The transformed plants of the invention can be produced by genetic 
30 engineering. Alternatively, transformed parent plants can be produced by genetic 
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engineering and used to transfer the foreign genes into subsequent generations by 
sexual or asexual reproduction. 

The methods of the present invention can be used in combination with any 
means for transformation of plants or plant cells. The present invention provides 
5 for the use of an optimized marker gene. The marker gene can be optimized for 

expression in a particular plant species, a particular genus of plants or a particular 
group of plants, for example monocots and/or dicots, maize, wheat, soybean, and 
the like. 

By marker gene is intended both selectable marker genes and reporter genes. 

10 Both selectable marker genes and reporter genes facilitate identification and 

selection of transformed cells. To date, all genetic transformation systems which 
have been developed rely upon a selectable marker or reporter gene to enable the 
recovery of transgenic plants. 

Reporter genes should ideally exhibit low background activity and should 

15 not have any detrimental effects on metabolism. The reporter gene products will 

have moderate stability in vivo, so that down-regulation of gene expression as well 
as gene activity can be detected. Finally, the reporter gene should be able to be 
assayed by a non-destructive, quantitative, sensitive, simple to perform and 
inexpensive system. 

20 Reporter genes are known in the art and include but are not limited to: 

Beta-glucuronidase (GUS) gene (Jefferson et al (1991) In Plant Molecular 

Biology Manual (Gelvin et a/., eds.), pp. 1-33, Kluwer Academic Publishers). This 

gene is encoded by the uidA locus of E. coli. GUS enzyme activity can be assayed 

easily and sensitively in plants. The expression of GUS gene fusions can be 
25 quantified by fluorometric assay, and histochemical analysis can be used to localize 

gene activity in transgenic tissues. 

Luciferase (DeWet et al (1987) Mol Cell Biol, 7:725-737). Luciferase 

catalyzes the oxidation of D(-)-luciferin in the presence of ATP to generate 

oxyluciferin and yellow-green light. 
30 Anthocyanins (Goff et al (1990) EMBO 9:2517-2522). Anthocyanin is a 

reporter system that does not require the application of external substrates for its 
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detection. The anthocyanin system utilizes the CI, B and R genes, which code for 
trans-acting factors that regulate the anthocyanin biosynthetic pathway in maize 
seeds. The introduction of these regulatory genes under the control of constitutive 
promoters includes cell-autonomous pigmentation in non-seed tissues. 
5 Green fluorescent protein (GFP) from the jellyfish Aequorea victoria (Kain 

et al (1995) BioTechniques, 79:650-655 and Chiu et al (1996) Current Biology, 
6:325-330). GFP emits bright green light when excited with UV or blue light. 
GFP fluorescence does not require a substrate or cofactor, is stable, and can be 
monitored non-invasively in living cells. 

10 Selectable marker genes are utilized for the selection of transformed cells or 

tissues. Selectable marker genes include genes encoding antibiotic resistance, such 
as those encoding neomycin phosphotransferase II (NEO) and hygromycin 
phosphotransferase (HPT) as well as genes conferring resistance to herbicidal 
compounds. Herbicide resistance genes generally code for a modified target protein 

15 insensitive to the herbicide or for an enzyme that degrades or detoxifies the 

herbicide in the plant before it can act. (See DeBlock et al (1987) EMBO J., 
6:2513-2518; DeBlock et al (1989) Plant Physiol, 97:691-704; Fromm et al 
(1990) 8:833-839; Gordon-Kamm et al (1990) 2:603-618) For example, resistance 
to glyphosate or sulfonylurea herbicides has been obtained by using genes coding 

20 for the mutant target enzymes, 5-enolpyruvylshikimate-3-phosphate synthase 

(EPSPS) and acetolactate synthase (ALS). Resistance to glufosinate ammonium, 
bromoxynil, and 2, 4-dichlorophenoxy acetate (2,4-D) have been obtained by using 
bacterial genes encoding phosphinothricin acetyltransferase, a nitrilase, or a 2,4- 
dichlorophenoxy acetate monooxygenase, which detoxify the respective herbicides. 

25 For purposes of the present invention, selectable marker genes include, but 

are not limited to genes encoding: neomycin phosphotransferase II (Fraley et al 
(1986) CRC Critical Reviews in Plant Science, 4:1-25); cyanamide hydratase 
(Maier-Greiner et al (1991) Proc. Natl Acad. Scl USA, 58:4250-4264); aspartate 
kinase; dihydrodipicolinate synthase (Perl et al (1993) Bio/Technology, 77:715- 

30 718); tryptophan decarboxylase (Goddijn et al (1993) Plant Mol Bio., 22:907- 

912); dihydrodipicolinate synthase and desensitized aspartade kinase (Perl et al 
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(1993) Bio/Technology, 77:715-718); bar gene (Toki et al. (1992) Plant Physiol, 
700:1503-1507 and Meagher et al (1996) and Crop Scl, 36:1367); tryptophan 
decarboxylase (Goddijn et al. (1993) Plant Mol. Biol., 22:907-912); neomycin 
phosphotransferase (NEO) (Southern et al. (1982) J. Mol. Appl. Gen., 7:327; 
5 hygromycin phosphotransferase (HPT or HYG) (Shimizu et al. (1986) Mol. Cell 

Biol, 6:1074); dihydrofolate reductase (DHFR) (Kwok et al (1986) PNAS USA 
4552); phosphinothricin acetyltransferase (DeBlock et al. (1987) EMBO J., 
6:2513); 2,2-dichloropropionic acid dehalogenase (Buchanan-Wollatron et al 
(1989) J. Cell Biochem. 13D.330); acetohydroxyacid synthase (Anderson et al, 

10 U.S. Patent No. 4,761,373; Haughn et al. (1988) Mol. Gen. Genet. 227:266); 5- 

enolpyruvyl-shikimate-phosphate synthase (aroA) (Comai et al. (1985) Nature 
317:141); haloarylnitrilase (Stalker etal, published PCT applcn WO87/04181); 
acetyl-coenzyme A carboxylase (Parker et al. (1990) Plant Physiol 92:1220); 
dihydropteroate synthase (sul I) (Guerineau et al. (1990) Plant Mol. Biol. 75:127); 

15 32 kD photosystem II polypeptide (psbA) (Hirschberg et al (1983) Science, 

222:1346); etc. 

Also included are genes encoding resistance to: chloramphenicol (Herrera- 

Estrella et al (1983) EMBO J., 2:987-992); methotrexate (Herrera-Estrella et al. 

(1983) Nature, 303:209-213; Meijertf a/. (1991) Plant Mol Bio., 76:807-820 
20 (1991); hygromycin (Waldron et al. (1985) Plant Mol. Biol, 5:103-108; Zhijian et 

al. (1995) Plant Science, 705:219-227 and Meijer et al. (1991) Plant Mol Bio. 

76:807-820); streptomycin (Jones et al (1987) Mol Gen. Genet., 270:86-91); 

spectinomycin (Bretagne-Sagnard et al. (1996) Transgenic Res., 5:131-137); 

bleomycin (Hille etal. (1986) Plant Mol. Biol, 7:\1\-\16); sulfonamide 
25 (Guerineau et al. (1990) Plant Mol. Bio., 75:127-136); bromoxynil (Stalker et al. 

(1988) Science, 242:419-423); 2,4-D (Streber et al. (1989) Bio/Technology, 7:811- 

816); glyphosate (Shaw etal (1986) Science, 233:478-481); phosphinothricin 

(DeBlock et al. (1987) EMBO J., 6:2513-2518). 

See generally, G. T. Yarranton (1992) Curr. Opin. Biotech., 3:506-511; 
30 Christopherson et al. (1992) Proc. Natl Acad. Sci. USA, 59:6314-6318; Yao et al 

(1992) Cell, 77:63-72; W. S. Reznikoff (1992) Mol. Microbiol, 6:2419-2422; 
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Barkley et al. (1980) The Operon, pp. 177-220; Hu et al (1987) Cell, 45:555-566; 
Brown £tf al. (1987) CWZ, 49:603-612; Figge et al. (1988) Cte/Z, 52:713-722; 
Deuschle et al. (1989) Proc. Natl Acad. Aci. USA, 56:5400-5404; Fuerst et al 
(1989) Proc. Natl Acad. Sci. USA, 56:2549-2553; Deuschle et al (1990) Science, 
5 245:480-483; M. Gossen (1993) PhD Thesis, University of Heidelberg; Reines et 

al (1993) Proc. Natl. Acad. Sci. USA, 90:1917-1921; Labow et al. (1990) Mol. 
Cell Bio., 70:3343-3356; Zambretti et al (1992) Proc. Natl Acad. Sci USA, 
59:3952-3956; Bairn et al (1991) Proc. Natl Acad. Sci USA, 55:5072-5076; 
Wyborski et al (1991) Nuc. Acids Res., 79:4647-4653; A. Hillenand-Wissman 

10 (1989) Topics in Mol and Struc. Biol, 70:143-162; Degenkolb et al. (1991) 

Antimicrob. Agents Chemother., 55:1591-1595; Kleinschnidt et al (1988) 
Biochemistry, 27:1094-1104; Gatz et al (1992) Plant J., 2:397-404; A. L. Bonin 
(1993) PhD Thesis, University of Heidelberg; Gossen et al. (1992) Proc. Natl 
Acad. Sci. USA, 59:5547-5551; Olivae/a/. (1992) Antimicrob. Agents Chemother., 

15 36:913-919; Hlavka et al (1985) Handbook of Exp. Pharmacology, 78\ Gill et al 

(1988) Nature 334:721-724. Such disclosures are herein incorporated by reference. 

The above list of selectable marker and reporter genes are not meant to be 
limiting. Any reporter or selectable marker gene are encompassed by the present 
20 invention. If necessary, such genes can be sequenced by methods known in the art. 

The reporter and selectable marker genes are synthesized for optimal 
expression in the plant. That is, the coding sequence of the gene has been modified 
to enhance expression in plants. The synthetic marker gene is designed to be 
25 expressed in plants at a higher level resulting in higher transformation efficiency. 

Methods for synthetic optimization of genes are available in the art. In fact, 
several genes have been optimized to increase expression of the gene product in 
plants. However, until the present invention no one had recognized that 
transformation efficiency could be improved by genetic modification of the marker 
30 gene for optimal expression in the cell being transformed. 
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The marker gene sequence can be optimized for expression in a particular 
plant species or alternatively can be modified for optimal expression in plant 
families. The plant preferred codons may be determined from the codons of highest 
frequency in the proteins expressed in the largest amount in the particular plant 
5 species of interest. See, for example, EPA 0359472; EPA 0385962; WO 91/16432; 

Perlak et al. (1991) Proc. Natl Acad. Sci. USA, 55:3324-3328; and Murray et al. 
(1989) Nucleic Acids Research, 17: 477-498. U.S. Patent No. 5,380,831; U.S. 
Patent No. 5,436,391; and the like, herein incorporated by reference. In this 
manner, the nucleotide sequences can be optimized for expression in any plant. It 
10 is recognized that all or any part of the gene sequence may be optimized or 

synthetic. That is, fully optimized or partially optimized sequences may also be 
used. 

In the same manner, genes can be optimized for expression in any organism. 
Thus, while the invention is described in relation to improving the transformation 

15 efficiency in plants, the methods of the invention can be applied to improving the 

transformation efficiency in any system. 

The marker genes of the invention are provided in expression cassettes for 
expression in the organism of interest. In this manner, the cassette will include 5' 
and 3 ! regulatory sequences operably linked to the gene of interest. Additionally, 

20 the expression cassette may be linked at the 5' end to various promoters from the 

same or different organisms. These promoters would be selected for strength 
and/or inducibility. Examples of such promoters include but are not limited to the 
ubiquitin-1 (Ubi-1) promoter or the cauliflower mosaic virus 35S (CaMv) promoter. 
See for example Christensen et al. (1992) Plant Mol. Biol. 18:675-689; Cornijo et 

25 al. . (1993) Plant Mol. Biol. 23:567-581; Hohn et al. (1993) PNAS 93(16):8334- 

8339. The expression cassette may also include 3' terminator regions linked to the 
gene of interest examples of which are the CaMV 35S terminator and the potato 
proteinase inhibitor protein or pin II terminator. See for example, Mitsuhara et al. 
(1996) Plant Cell Physiol. 37(l):49-59; Seymour et al. (1993) Plant Mol. Biol. 

30 23(1): 1-9; The cassette may additionally contain at least one gene to be 

cotransformed into the organism. Alternatively, the additional gene(s) of interest 
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can be provided on another expression cassette. Where appropriate, the additional 
gene(s) of interest may be optimized for increased expression in the transformed 
plant. 

The expression cassettes may additionally contain 5' leader sequences in the 
5 expression cassette construct. Such leader sequences can act to enhance translation. 

Translation leaders are known in the art and include: picornavirus leaders, for 
example, EMCV leader (Encephalomyocarditis 5' noncoding region) (Elroy-Stein, 
O., Fuerst, T.R., and Moss, B. (1989) PNAS USA, 56:6126-6130); potyvirus 
leaders, for example, TEV leader (Tobacco Etch Virus) (Allison et al (1986); 

10 MDMV leader (Maize Dwarf Mosaic Virus); Virology, 154:9-20), and human 

immunoglobulin heavy-chain binding protein (BiP), (Macejak, D.G., and P. Sarnow 
(1991) Nature, 553:90-94; untranslated leader from the coat protein mRNA of 
alfalfa mosaic virus (AMV RNA 4), (Jobling, S.A., and Gehrke, L., (1987) 
Nature, 325:622-625; tobacco mosaic virus leader (TMV), (Gallie, D.R. et al 

15 (1989) Molecular Biology of RNA, pages 237-256; and maize chlorotic mottle virus 

leader (MCMV) (Lommel, S.A. et al (1991) Virology, 87:382-385). See also, 
Della-Cioppa et al (1987) Plant Physiology, 84:965-968. Other methods known to 
enhance translation can also be utilized, for example, introns, and the like. 

In preparing the expression cassette, the various DNA fragments may be 

20 manipulated, so as to provide for the DNA sequences in the proper orientation and, 

as appropriate in the proper reading frame. Towards this end, adapters or linkers 
may be employed to join the DNA fragments or other manipulations may be 
involved to provide for convenient restriction sites, removal of superfluous DNA, 
removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, 

25 primer repair, restriction, annealing, resection, ligation, PCR, or the like may be 

employed, where insertions, deletions or substitutions, e.g. transitions and 
trans versions, may be involved. 

The compositions and methods of the present invention can be used in any 
transformation protocol. Such transformation protocols may vary depending on the 

30 type of plant or plant cell, i.e. monocot or dicot, targeted for transformation. 

Suitable methods of transforming plant cells include microinjection (Crossway et al 
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(1986) Biotechniques 4:320-334), electroporation (Riggs et al (1986) Proc. Natl. 
Acad. Sci. USA, 55:5602-5606, Agrobacterium mediated transformation (Hinchee et 
al. (1988) Biotechnology, 6:915-921), direct gene transfer (Paszkowski et al. 
(1984) EMBOJ., 5:2717-2722), and ballistic particle acceleration (see, for 
5 example, Sanford et al, U.S. Patent 4,945,050; WO91/10725 and McCabe et al 

(1988) Biotechnology, 6:923-926). Also see, Weissinger et al. (1988) Annual Rev. 
Genet., 22:421-477; Sanford et al. (1987) Particulate Science and Technology, 
5:27-37 (onion); Christou et al (1988) Plant Physiol 57:671-674(soybean); 
McCabe et al (1988) Bio/Technology, 6:923-926 (soybean); Datta et al (1990) 

10 Biotechnology, 5:736-740(rice); Kleins al (1988) Proc. Natl. Acad. Sci. USA, 

55:4305-4309 (maize); Klein et al (1988) Biotechnology, 6:559-563 (maize); 
WO91/10725 (maize); Kleins al (1988) Plant Physiol, 97:440-444(maize); 
Fromm et al (1990) Biotechnology, 5:833-839; and Gordon-Kamm et al. (1990) 
Plant Cell, 2:603-618 (maize); Hooydaas-Van Slogteren & Hooykaas (1984) Nature 

15 (London), 577:763-764; Bytebier et al. (1987) Proc. Natl Acad. Sci. USA, 

54:5345-5349 (Liliaceae); De Wet et al (1985) In The Experimental Manipulation 
of Ovule Tissues, ed. G.P. Chapman et al, pp. 197-209. Longman, NY (pollen); 
Kaeppler et al. (1990) Plant Cell Reports, 9:415-418; and Kaeppler et al (1992) 
Theor. Appl. Genet., 54:560-566 (whisker-mediated transformation); D'Halluin^r 

20 al (1992) Plant Cell, 4:1495-1505 (electroporation); Li et al (1993) Plant Cell 

Reports, 72:250-255 and Christou and Ford (1995) Annals of Botany, 75:407-413 
(rice); Osjoda et al. (1996) Nature Biotechnology, 74:745-750 (maize via 
Agrobacterium tumefaciens)\ all of which are herein incorporated by reference. 

The plant plastid can also be transformed directly. Stable transformation of 

25 plastids have been reported in higher plants. See, for example, Svab et al (1990) 

Proc. Nat'l. Acad. Sci. USA, 57:8526-8530; Svab & Maliga (1993) Proc. Nat'l 
Acad. Sci. USA, 90:913-917; Svab & Maliga (1993) EMBOJ., 72:601-606. The 
method relies on particular gun delivery of DNA containing a selectable marker and 
targeting of the DNA to the plastid genome through homologous recombination. 

30 Additionally, plastid transformation can be accomplished by trans-activation of a 

silent plastid-borne transgene by tissue-specific expression of a nuclear-encoded and 
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plastid-directed RNA polymerase. Such a system has been reported in McBride et 
al. (1994) Proc. Natl. Acad. ScL, USA, 97:7301-7305. Where the transformation 
protocol is directed to plastid transformation, the marker genes are optimized for 
expression in the plant plastid. 
5 The cells which have been transformed may be grown into plants in 

accordance with conventional ways. See, for example, McCormick et al. (1986) 
Plant Cell Reports, 5:81-84. These plants may then be grown, and either pollinated 
with the same transformed strain or different strains, and the resulting hybrid 
having the desired phenotypic characteristic identified. Two or more generations 

10 may be grown to ensure that the subject phenotypic characteristic is stably 

maintained and inherited and then seeds harvested to ensure the desired phenotype 
or other property has been achieved. 

While the present method has broad applicability, it is particularly useful in 
transforming plants which have been recalcitrant to known transformation methods. 

15 That is using the present method, maize elite lines, inbreds, and other lines difficult 

to transform can be transformed directly. 

The following examples are offered by way of illustration and not by way of 
limitation. 

20 EXPERIMENTAL 

EXAMPLE I. GENERATION OF A MONOCOT-OPTIMIZED PAT GENE 
The PAT gene, which confers resistance to the herbicide glufosinate 

ammonium, was originally cloned from Streptomyces viridochromogenes . The 

plasmid pB2/35SAcK consists of a synthetic plant-optimized PAT gene fused to a 
25 35S-promoter and terminator cloned into pUC19. A second construct consisting of 

the PAT gene fused to a plant ubiquitin promoter was also utilized. 

The PAT gene was first modified for expression in plants by replacing the 

GTG codon with ATG, and by modifying the four nucleotides upstream of the ATG 

codon to generate a plant-optimized PAT gene. The present invention relates to the 
30 further modification of the PAT gene to generate an example of a monocot- 

optimized selectable marker gene, the "monocot-optimized" PAT gene (moPAT). 



WO 98/30701 PCT/US98/00359 

-12- 

Analysis of the success of monocot-optimization was determined by recovery of 
transformants when the monocot-optimized PAT gene was used as a selectable 
marker, resulting in the isolation of a large number of highly herbicide-resistant 
events. 

5 In order to generate a monocot-optimized form of the PAT gene, the 

preferred codon usage patterns for maize were examined. See, for example, 
Adang, U.S. Patent No. 5,380,831. Information regarding the preferred codon 
usage of maize allowed for the replacement of codons with those codons that were 
more frequently used in maize. Codons were altered without altering the amino 
10 acid sequence of the PAT polypeptide. A codon usage table that reflects the codon 

usage of the monocot T^ea mays was utilized to optimize the PAT gene expression in 
monocots, particularly maize. Because the codon usage among monocots is similar, 
the genes can be used in any monocot, for example, wheat. It is further recognized 
that monocot optimized sequences may express in acceptable levels in dicots. 
15 The plant-optimized sequence of the PAT gene was translated and compared 

to the native protein sequence. The protein sequence was then back-translated to 
nucleotide sequence using the above-described maize codon usage table. 
Modifications of the nucleotide sequence were not made if such modification would 
result in alteration of the amino acid sequence of the encoded PAT protein. The 
20 basic methodology utilized to generate the monocot-optimized PAT gene sequence 

(moPAT) is outlined below: 

a. The PAT protein amino acid sequence was "back-translated" to 

obtain a nucleotide sequence having those codons most frequently 
used in maize. A nucleotide sequence having codons reflecting 
25 preferred codon usage patterns of maize and encoding the PAT 

protein was determined. The amino acid sequence of the protein 
encoded by the back-translated, modified "monocot-optimized" 
nucleotide sequence was identical to the amino acid sequence 
encoded by the native PAT nucleotide sequence. 
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b. The nucleotide sequence was further modified by removal of regions 
of the gene including potential RNA processing sites, degradation 
sequences, and premature polyadenylation sequences. 

c. Codons used <5% were avoided where possible. 

d. The nucleotide sequence was further modified to delete clusters of 
A/T nucleotide pairs and G/C clusters having more than 10 G/C 
nucleotide pairs where possible. The GC content for maize genes is 
preferably 60-65% of the total nucleotide sequence. 

e. Regions predicted to develop hairpin structures having a free energy 
of -12 kcal/mol were eliminated. 

f . Cloning sites comprising a restriction enzyme recognition sequence 
may be added or removed. 

g. Translation initiation sequence, based on a consensus from highly 
expressing maize genes (ACACGACACCatg), was added. 

Other factors, such as those that influence transcriptional or translational 
initiation sizes, secondary structure of the gene or transcript, or result in 
modification of the poly(A) tail of the mRNA were additionally considered. A 
synthetic gene was then synthesized which incorporates such alterations and is 
shown in Figure 1 . 

EXAMPLE II. UTILIZATION OF THE moPAT GENE FOR INCREASED 
TRANSFORMATION EFFICIENCY 

To confirm that the monocot-optimized gene was expressed in maize and 
produced a gene product in maize tissues, expression vectors containing the moPAT 
gene were transformed into maize cells or tissues. Several plasmids were generated 
in order to test the ability of the synthetic moPAT gene to function in a monocot- 
optimized fashion. Two separate methods were utilized in order to determine the 
efficiency and level of gene expression in monocots transformed with the monocot- 
optimized gene constructs. One method includes transformation of monocot cells 
with the monocot-optimized gene constructs followed by exposure of the cells to an 
herbicide (such as Bialaphos®, Basta, or glufosinate ammonium) to which the PAT 
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gene is known to confer resistance. The number of colonies recovered following 
exposure to the herbicide is an indication of the ability of the monocot-optimized 
gene to function in maize tissues and cells. If transformation is benefited, a greater 
number of transformed events (herbicide-resistant colonies) would be recovered 
following transformation with the moPAT gene than following transformation with 
the PAT gene. 

The data indicated that transformation of cells (comprising model or elite 
maize genotypes) with the 18092 (Ubi::moPAT) plasmid followed by selection on 
Bialaphos resulted in the generation of resistant calli. Such resistant colonies 
appear at an earlier time point and grow at a faster rate than those cells transformed 
with the PAT gene construct 16609. Furthermore, the data indicated that 
transformation of maize cells with the 18092 plasmid comprising the moPAT gene 
resulted in the isolation of a greater number of transformed events than 
transformation with the 16609 plasmid comprising the PAT gene (Tables 1 and 2). 



RESULTS OF TRANSFORMATIONS DONE WITH moPAT: 

Table 1 



Experiment 16609 clones 18092 (moPaf) clones 

1 12 59 

2 8 56 

3 7 1 

4 6 6 

5 5 3 

6 0 1 

7 0 2 



Total: 38 128 



Table 2 



Experiment Construct # Plates Shot # Resistant Calli Frequency 

1 6609 42 800 19.0% 

2 6609 42 1006 23.4% 

3 8092 36 1140 31.7% 
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4 8092 40 1116 27.9% 

A second approach that was utilized to determine the ability of the moPAT 
5 gene to direct expression of the moPAT gene product in maize was detection of the 

PAT gene product in extracts of maize tissues. A large number of events 
transformed with 18092 (Ubi::moPAT) were regenerated. Table 3 gives the results 
of these experiments using monocots derived from transformation with either the 
16609 plasmid comprising the PAT gene or the 18092 plasmid comprising the 

10 moPAT gene (ELISA values reported at pg//*g protein). The data indicates that the 

PAT gene product is detected at greater than 200 pg//xg total soluble protein (tsp) in 
a larger proportion of events transformed with the moPAT gene as compared to 
events transformed with the PAT gene. The data further indicated that 
transformation with the moPAT gene results in the recovery of a greater number of 

15 herbicide-resistant transformed events than recovery following transformation with 

the PAT gene. 

moPAT ELISA RESULTS 
Table 3 

20 

ff Events 

Construct Analyzed Negative 1^50 51-100 > 100 

16609 46 20 12 6 8 

25 18092 84 38 5 6 35 



EXAMPLE III. UTILIZATION OF A MONOCOT-OPTIMIZED GENE FOR 
30 INCREASED RECOVERY OF EVENTS TRANSFORMED WITH A NON- 

SELECTABLE GENE OF INTEREST 

There is a need in the art to produce large numbers of transgenic events 

when developing transgenic crops. As demonstrated in the above-described 

example (Example II), a monocot-optimized gene may be utilized to increase 

35 recovery of transformed events following selection with a drug to which the 

monocot-optimized gene confers resistance. An important obstacle encountered by 
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many skilled in the art is the inability to simply and accurately select for certain 
genes of interest. The moPAT gene functions in such a capacity, in that 
transformation of the moPAT gene with a gene of interest encoding a non-selectable 
gene product allows for selection of transformed events by selection of the 
5 transformed events in the present of a drug. 

The gene encoding a nonselectable gene product may be encoded on the 
same plasmid comprising the moPAT gene or may be comprised within a separate 
plasmid or DNA molecule. If the gene of interest is provided on a separate 
plasmid, then it is likely, although not absolute, that the herbicide-resistant 

10 transformed events have been co-transfected with the non-selectable gene of 

interest, such as the crylA(b) gene. Similarly, if the gene of interest resides on the 
same plasmid comprising the moPAT gene, then it is likely that herbicide-resistant 
events will comprise the gene of interest in addition to the moPAT gene. 
Therefore, by selection of transformed events using a drug to which the moPAT 

15 gene confers resistance, the probability of isolating a transformed event expressing 

the gene product of the gene of interest is increased. This is extremely important 
when producing transgenic crops in that large numbers of transgenic events must be 
isolated. Thus, the optimized gene enhances the ability to recover transformed 
events following transformation with a non-selectable gene of interest. 

20 

EXAMPLE IV. TRANSGENIC MAIZE COMPRISING A MONOCOT- 
OPTIMIZED moPAT GENE 

To provide a maize plant comprising a monocot-optimized gene, a 

transgenic maize plant is generated by transformation of a monocot-optimized gene 

25 into a maize regenerable tissue followed by regeneration of said regenerable tissue 

into a mature trangenic maize plant. The maize regenerable tissue is transformed 
with an expression vector comprising a monocot-optimized gene. Following 
regeneration of the mature maize plant, tissues of the transgenic plant are harvested 
and assayed for the presence of the monocot-optimized gene. 

30 Monocot cells were transformed by methods known in the art. See, for 

example, Klein et al. (1988) Proc. Natl Acad. Sci. USA, 85:4305-4309 and Klein 



WO 98/30701 



PCT/US98/00359 



-17- 

et al (1989) Proc. Natl Acad, Sci. USA, 56:6681-6685, herein incorporated by 
reference. Generally, GS3 callus lines were the target issue. GS3 is a high-type II 
(model) corn genotype. Generally, the callus was sieved in perparation for 
bombardment. 

5 After being bombed with particles coated with DNA at a concentration of 

about 0. 1/xg DN A/shot/plate, the tissue was maintained for two days on medium 
with no selection agent, afterwhich the tissue was transferred to medium with a 
selection agent (3/xg/liter bialophos) to initiate the plant regeneration process. 
Expression of the moPAT gene and its gene product confers a selective 
10 advantage to the transgenic plant. Thus, a transgenic plant is generated that has a 

selective advantage (herbicide resistance) over a non-transgenic plant. 

EXAMPLE V. COMPARISON OF TRANSFORMATION EFFICIENCIES IN 
TRANSGENIC WHEAT (PAT vs moPATV 
15 PAT and moPAT were transformed into the cultivar Bob white (wheat) in 

order to determine whether the monocot-optimized C AH sequence would lead to 

improved transformation efficiencies in other plant species besides maize. Two 

experiments were conducted comparing the transformation efficiencies of the PAT 

gene with moPAT. Heads were harvested 12-14 days post anthesis, seeds sterilized 

20 in 20% sodium hypochlorite for 30 min., and rinsed three times in sterile water. 

Immature embryos were excised and plated on MS salts, 2% sucrose, 150 mg/1 
asparagine, 0.5 mg/1 thiamine HC1, 1.5 mg/1 2,4-D, pH 5.8, solidified with 2.5 g/1 
Gelrite (initiation medium). Plates were incubated in the dark at 26°C. Embryos were 
transferred, five days post excision, to the above medium supplemented with 0.4M 

25 mannitol and cultured for four hours, then bombarded with 1 micron gold particles 

(0.083 ug DNA, 650 psi). 

Twenty hours post-bombardment, embryos were transferred from the high 
osmotic medium to initiation medium containing 3 mg/1 Bialaphos and cultured in 
the dark (16/8 photo-period). The embryos were subcultured approximately every 

30 2 weeks for 4 months. Resistant calli were placed initially on regeneration media 

(MS salts and vitamins, 2% sucrose, 0.5 mg/1 Dicamba, 3 mg/1 Bialaphos, 2.5 mg/1 
Gelrite), and upon shoot formation, transferred to the same medium (minus 
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Dicamba) containing 5 mg/1 Zeatin and transferred to light. Shoots were rooted in 
MS salts and vitamins at concentrations of 0.1 g/1 myo-inositol, 4% sucrose, 0.7 
mg/1 IB A, 0.3 mg/1 NAA, 1.5 g/1 Gelrite. The transformation efficiency for the 
moPAT construct was triple that of PAT. The transformation efficiency for PAT 
5 was 0.3% and for moPAT was 0.9%, as is shown in Table 4. 



PAT vs. moPAT TRANSFORMATION EFFICIENCY 

Table 4 











# PCR + 










#PCR + 


Events 


Trans. 


Gene 




Embryos 


Events 


with seed 


Efficiency 


PAT 


6609 


625 


4 


2 


0.3% 


moPAT 


8092 


568 


6 


5 


0.9% 



EXAMPLE VI. GENERATION OF A MONOCOT-OPTIMIZED CAH 
SEQUENCE 

20 Cyanamide in aqueous solution or in the form of its calcium salt is used as a 

fertilizer in agriculture. It also can act as an effective herbicide if applied prior to 
sowing. The enzyme cyanamide hydratase hydrates the nitrile group of cyanamide 
to form urea. 

Cyanamide hydratase has been purified from Myrothecium verrucaria. See, 
25 Maier-Greiner et al (1991) Proc. Natl Acad. Sci. USA, £5:4260-4264, herein 

incorporated by reference. The gene encoding cyanamide hydratase was optimized 
for expression in maize by the methods outlines above. Codons were altered 
without altering the amino acid sequence of the enzyme. The nucleic acid and 
amino acid sequences of the optimized gene are given in Figure 2. . 



30 
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EXAMPLE VII. ANALYSIS OF Tl PROGENY OF moCAH TRANSGENIC 
EVENTS: 

To characterize the stability of the moCAH selectable marker from parent to 
5 progeny, maize cells were transformed with moCAH and successive generations were 

assayed for the presence of the moCAH gene. Three transgenic events were produced 
using the monocot-optimized cyanamide hydratase gene (moCAH) as the selectable 
marker. These events were confirmed at TO plant level by Southern blot analysis. In 
order to confirm transgene inheritance in these events Tl plants were analyzed for the 

10 presence of the transgene using PCR and herbicide leaf painting techniques. Seeds 

were planted in soil in the greenhouse and plants were sampled at V4-5 leaf stage. In 
the first transgenic event thirty-nine Tl plants from one TO plant were analyzed by 
PCR of which twenty five Tl plants were positive for the moCAH transgene. In the 
second transgenic event, two TO plants were used. Fifty nine Tl plants from the first 

15 TO were analyzed by PCR, of which thirty two were positive for the moCAH 

transgene. Out of fourteen Tl plants from the second TO plant, eight Tl plants were 
positive for the moCAH transgene. None of the seventy-one Tl seeds germinated in 
the third transgenic event due to poor seed quality and mold problems. 

To find out whether the inherited moCAH gene was still functional, Tl plants 

20 at V5-6 stage were leaf-painted with a 10% solution of the commercial herbicide 

Dormex (containing 50% cyanamide). All control non- transformed plants were 
susceptible to the herbicide with the painted leaf showing severe damage. Some 
transgenic Tl plants from both transgenic events were completely resistant to 50% 
cyanamide demonstrating no damage from leaf painting. In Event #2, PCR analysis 

25 showing the presence of the moCAH transgene and leaf painting results 

demonstrating improved resistance to cyanamide displayed a segregation pattern of 
approximately 1:1. These results clearly demonstrate that the moC AH gene can be 
used efficiently as a selectable marker to transform maize plants and that the 
introduced moCAH gene can be stably integrated into the maize genome and 

30 transmitted to the following generation. The DNA construct that contained the 

moCAH gene was arranged on the integrating vector 10675 in the order Ubi 
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promoter;; moCAH gene;; Pinll terminator. Sequences of the primers used for PCR 
confirmation of moCAH presence in the plant genome are given below. 

primer 1: CTACAACCACTCCATGCGCGTGTTC 

5 

primer 2: CACATAACACACAACTTTGATGCCCAC 



EXAMPLE VIII. COMPARISON OF TRANSFORMATION EFFICIENCIES TN 
10 TRANSGENIC WHEAT fCAH VS. moCAH). 

To test whether the use of moCAH improved the transformation efficiency in 

other species, both CAH and moCAH were transformed into the wheat cultivar 

(Bobwhite). Four experiments were conducted to test CAH against moCAH 

transformation efficiency. Media used were as described above except that 37.5 mg/1 

15 cyanamide was substituted for Bialaphos.. No selection agent was used in the 

regeneration and rooting stages. The results showed a 0.0% transformation efficiency 

for CAH and a 0.8% transformation efficiency for moCAH. 

CAH vs. mo CAH TRANSFORMATION EFFICIENCY 
20 Table 5 



#PCR+ 

# #PCR+ Events Trans. 

Gene Dp# Embryos Events with seed Efficiency 

25 

CAH 10660 652 0 0 0% 

moCAH 10675 653 5 5 0.8% 



30 All publications and patent applications mentioned in the specification are 

indicative of the level of those skilled in the art to which this invention pertains. 
All publications and patent applications are herein incorporated by reference to the 
same extent as if each individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 

35 Although the foregoing invention has been described in some detail by way 

of illustration and example for purposes of clarity of understanding, it will be 
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obvious that certain changes and modifications may be practiced within the scope of 
the appended claims. 



WO 98/30701 



PCT/US98/00359 



-22- 

IN THE CLAIMS 

What is claimed is: 

1 . A method for transforming a plant, said method comprising 
ransforming a cell from said plant with a marker gene wherein said marker gene has 
been optimized for expression; and regenerating said cell into a transformed 
plant. 

2. The method of claim 1, wherein said method comprises 
transforming said cell with at least on additional gene. 

3 . The method of claim 1 , wherein said transformation is nuclear 
transformation. 

4. The method of claim 1, wherein said transformation is chloroplast 
transformation. 

5. The method of claim 1, wherein said plant is a monocot. 

6. The method of claim 5, wherein said monocot is maize. 

7. The method of claim 1, wherein said plant is a dicot. 

8. The method of claim 1, wherein said marker gene is a selectable 
marker gene. 

9. The method of claim 8, wherein said selectable marker gene is 
selected from a gene encoding antibiotic resistance or a gene encoding resistance to 
a herbicide. 
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10. The method of claim 9, wherein said gene encodes phosphinothricin 
acety ltransf erase . 

11. The method of claim 9, wherein said gene encodes cyanamide 
hydratase. 

12. The method of claim 1, wherein said marker gene is a reporter 

gene. 

13. The method of claim 12, wherein said reporter gene is green 
fluorescent protein. 

14. A method for increasing transformation efficiency, said method 
comprising transforming a cell with a marker gene wherein said marker gene has 
been optimized for expression in said cell; and regenerating said transformed cells 
into a transformed organism. 

15. The method of claim 14, wherein said organism is a plant. 

16. The method of claim 15, wherein said plant is a monocot. 

17. The method of claim 16, wherein said monocot is maize. 

18. The method of claim 14, wherein said plant is a dicot. 

19. The method of claim 12, wherein said transformation efficiency is 
increased at least two fold. 

20. The method of claim 14, wherein said transformation efficiency is 
increased at least five fold. 
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21. The method of claim 14, wherein said transformation efficiency is 
increased at least ten fold. 

22. The method of claim 14, wherein said method comprises 
transforming said cell with at least one additional gene. 

23 . A marker gene which has been modified to optimize expression in a 

plant. 

24. The gene of claim 23, wherein said gene is a selectable marker gene. 

25. The gene of claim 24, wherein said gene is selected from a gene 
encoding antibiotic resistance or a gene encoding resistance to a herbicide. 

26. The gene of claim 25, wherein said gene encodes phosphinothricin 
acetyltransferase. 

27. The gene of claim 26, wherein said gene encodes cyanamide 
hydratase. 

28. The gene of claim 23, wherein said marker gene is a reporter gene. 

29. The gene of claim 28, wherein said marker gene is green 
fluorescent protein. 

30. A plant which has been transformed with the gene of claim 20. 

31 . Transformed seed from the plant of claim 30. 

32. A plant which has been transformed with the gene of claim 27. 
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33. Transformed seed from the plant of claim 32. 

34. A plant which has been transformed with the gene of claim 29. 

35. Transformed seed from the plant of claim 34. 

36. The gene of claim 26, wherein said gene has the nucleotide 
sequence given in Figure 1 . 

37. The gene of claim 27, wherein said gene has the nucleotide 
sequence given in Figure 2. 

38. A plant having stably incorporated into its genome a marker gene, 
wherein said marker gene has been optimized for expression. 

39. The plant of claim 38, wherein said plant comprises at least one 
additional chimeric gene incorporated into its genome. 

40. Seed of the plant of claim 38. 

41 . Seed of the plant of claim 39. 
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FIGURE 1 



MoPAT Sequence and translation: 

1 GGATCCACAC GACACC ATG T CCCCCGAGCG CCGCCCCGTC GAGATCCGCC CGGCCACCGC 
61 CGCCGACATG GCCGCCGTGT GCGACATCGT GAACCACTAC ATCGAGACCT CCACCGTGAA 
121 CTTCCGCACC GAGCCGCAGA CCCCGCAGGA GTGGATCGAC GACCTGGAGC GCCTCCAGGA 
181 CCGCTACCCG TGGCTCGTGG CCGAGGTGGA GGGCGTGGTG GCCGGCATCG CCTACGCCGG 
241 CCCGTGGAAG GCCCGCAACG CCTACGACTG GACCGTGGAG TCCACCGTGT ACGTGTCCCA 
301 CCGCCACCAG CGCCTCGGCC TCGGCTCCAC CCTCTACACC CACCTCCTCA AGAGCATGGA 

3 61 GGCCCAGGGC TTCAAGTCCG TGGTGGCCGT GATCGGCCTC CCGAACGACC CGTCCGTGCG 
421 CCTCCACGAG GCCCTCGGCT ACACCGCCCG CGGCACCCTC CGCGCCGCCG GCTACAAGCA 

4 81 CGGCGGCTGG CACGACGTCG GCTTCTGGCA GCGCGACTTC GAGCTGCCGG CCCCGCCGCG 
541 CCCGGTGCGC CCGGTGACGC AGATC TGA GG TGTCGTGTTA AC 



+1 MetSerProGluArgArgProValGluIleArgProAlaThrAlaAlaAspMetAlaAla 
1 ATGTCCCCCGAGCGCCGCCCCGTCGAGATCCGCCCGGCCACCGCCGCCGACATGGCCGCC 
TACAGGGGGCTCGCGGCGGGGCAGCTCTAGGCGGGCCGGTGGCGGCGGCTGTACCGGCGG 



+1 ValCysAspIleValAsnHisTyrlleGluThrSerThrValAsnPheArgThrGluPro 
61 GTGTGCGACATCGTGAACCACTACATCGAGACCTCCACCGTGAACTTCCGCACCGAGCCG 
CACACGCTGTAGCACTTGGTGATGTAGCTCTGGAGGTGGCACTTGAAGGCGTGGCTCGGC 



+1 GlnThrProGlnGluTrpIleAspAspLeuGluArgLeuGlnAspArgTyrProTrpLeu 
121 CAGACCCCGCAGGAGTGGATCGACGACCTGGAGCGCCTCCAGGACCGCTACCCGTGGCTC 
GTCTGGGGCGTCCTCACCTAGCTGCTGGACCTCGCGGAGGTCCTGGCGATGGGCACCGAG 



+1 ValAlaGluValGluGlyValValAlaGlylleAlaTyrAlaGlvProTrpLysAlaArg 
1 GTGGCCGAGGTGGAGGGCGTGGTGGCCGGCATCGCCTACGCCGGCCCGTGGAAGGCCCGC 
CACCGGCTCCACCTCCCGCACCACCGGCCGTAGCGGATGCGGCCGGGCACCTTCCGGGCG 



+1 AsnAlaTyrAspTrpThrValGluSerThrValTyrValSerHisArgHisGlnArgLeu 
2 41 AACGCCTACGACTGGACCGTGGAGTCCACCGTGTACGTGTCCCACCGCCACCAGCGCCTC 
TTGCGGATGCTGACCTGGCACCTCAGGTGGCACATGCACAGGGTGGCGGTGGTCGCGGAG 



+1 GlyLeuGlySerThrLeuTyrThrHisLeuLeuLysSerMetGluAlaGlnGlyPheLys 
301 GGCCTCGGCTCCACCCTCTACACCCACCTCCTCAAGAGCATGGAGGCCCAGGGCTTCAAG 
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FIGURE 1 (cont.) 



CCGGAGCCGAGGTGGGAGATGTGGGTGGAGGAGTTCTCGTACCTCCGGGTCCCGAAGTTC 

+1 SerValValAlaVallleGlyLeuProAsnAspProSerValArgLeuHisGluAlaLeu 

3 61 TCCGTGGTGGCCGTGATCGGCCTCCCGAACGACCCGTCCGTGCGCCTCCACGAGGCCCTC 

AGGCACCACCGGCACTAGCCGGAGGGCTTGCTGGGCAGGCACGCGGAGGTGCTCCGGGAG 

+1 GlyTyrThrAlaArgGlyThrLeuArgAlaAlaGlyTyrLysHisGlyGlyTrpHisAsp 

4 21 GGCTACACCGCCCGCGGCACCCTCCGCGCCGCCGGCTACAAGCACGGCGGCTGGCACGAC 

CCGATGTGGCGGGCGCCGTGGGAGGCGCGGCGGCCGATGTTCGTGCCGCCGACCGTGCTG 

+1 ValGlyPheTrpGlnArgAspPheGluLeuProAlaProProArgProValArgProVal 
4 81 GTCGGCTTCTGGCAGCGCGACTTCGAGCTGCCGGCCCCGCCGCGCCCGGTGCGCCCGGTG 
CAGCCGAAGACCGTCGCGCTGAAGCTCGACGGCCGGGGCGGCGCGGGCCACGCGGGCCAC 

+1 ThrGlnlle*** 
541 ACGCAGATCTGA 
TGCGTCTAGACT 
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FIGURE 2 



1 TCGCGCGTTT 
AGCGCGCAAA 



CGGTGATGAC 
GCCACTACTG 



GGTGAAAACC 
CCACTTTTGG 



TCTGACACAT 
AGACTGTGTA 



GCAGCTCCCG 
CGTCGAGGGC 



51 GAGACGGTCA 
CTCTGCCAGT 



CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA 
GTCGAACAGA CATTCGCCTA CGGCCCTCGT 



GACAAGCCCG 
CTGTTCGGGC 



101 TCAGGGCGCG 
AGTCCCGCGC 



TCAGCGGGTG 
AGTCGCCCAC 



TTGGCGGGTG 
AACCGCCCAC 



TCGGGGCTGG 
AGCCCCGACC 



CTTAACTATG 
GAATTGATAC 



151 CGGCATCAGA 
GCCGTAGTCT 



GCAGATTGTA 
CGTCTAACAT 



CTGAGAGTGC 
GACTCTCACG 



ACCATATGCG 
TGGTATACGC 



GTGTGAAATA 
CACACTTTAT 



2 01 CCGCACAGAT 
GGCGTGTCTA 



GCGTAAGGAG 
CGCATTCCTC 



AAAATACCGC 
TTTTATGGCG 



ATCAGGCGCC 
TAGTCCGCGG 



ATTCGCCATT 
TAAGCGGTAA 



251 CAGGCTGCGC 
GTCCGACGCG 



AACTGTTGGG 
TTGACAACCC 



AAGGGCGATC 
TTCCCGCTAG 



GGTGCGGGCC 
CCACGCCCGG 



TCTTCGCTAT 
AGAAGCGATA 



301 TACGCCAGCT 
ATGCGGTCGA 

351 ACGCCAGGGT 
TGCGGTGCCA 
Hindi I I 

401 GCTTGCATGC 
CGAACGTACG 



GGCGAAAGGG 
CCGCTTTCCC 

TTTCCCAGTC 
AAAGGGTCAG 

CTGCAGGTCG 
GACGTCCAGC 



GGATGTGCTG 
CCTACACGAC 



ACGACGTTGT 
TGCTGCAACA 



CAAGGCGATT 
GTTCCGCTAA 



AAAACGACGG 
TTTTGCTGCC 



Hpal 



ACTCTAGAGT 
TGAGATCTCA 



TAACACGACA 
ATTGTGCTGT 



AAGTTGGGTA 
TTCAACCCAT 
Hindlll 
CCAGTGCCAA 
GGTCACGGTT 

CCTCACTCCC 
GGAGTGAGGG 



4 51 ACGGCTTCAT 
TGCCGAAGTA 



CAGGGTGTTG 
GTCCCACAAC 



GCCTCCATCT 
CGGAGGTAGA 



GCTTGTCGAA 
CGAACAGCTT 



CTGCGGGATG 
GACGCCCTAC 



501 TGGGTGGTGT 
ACCCACCACA 



GGCACCACGG 
CCGTGGTGCC 



CTTGTTGGAC 
GAACAACCTG 



TCCTCCTTGC 
AGGAGGAACG 



GCACGGTGCA 
CGTGCCACGT 



551 GGCGAACCAG 
CCGCTTGGTC 



GAGCACCAGC 
CTCGTGGTCG 



CGTGGCGCGG 
GCACCGCGCC 



GAAGGCGGTG 
CTTCCGCCAC 



TTGATGGAGT 
AACTACCTCA 



601 TGCGGGTGGT 
ACGCCCACCA 



GTCGTCCACC 
CAGCAGGTGG 



CAGGAGCCGA 
GTCCTCGGCT 



AGTCGTCGAT 
TCAGCAGCTA 



GCCGTCGTAG 
CGGCAGCATC 



651 GCGCCCACGT 
CGCGGGTGCA 



TGTCGTAGAG 
ACAGCATCTC 



GGTGGCGAGC 
CCACCGCTCG 



TGGATGAGCT 
ACCTACTCGA 



GGCCGAGGAA 
CCGGCTCCTT 



7 01 GGTGATGTTG 
CCACTACAAC 



CCGTCGACGC 
GGCAGCTGCG 



CGACGTCCTC 
GCTGCAGGAG 



GTGGCGGATG 
CACCGCCTAC 



ATGGCCTCGG 
TACCGGAGCC 



751 CCACCGCCTC 
GGTGGCGGAG 



CGCCTGGTCG 
GCGGACCAGC 



GTGGAGGAGC 
CACCTCCTCG 



CGAGCACCT 
GCTCGTGGAA 



GAGGACCTCC 
CTCCTGGAGG 



801 ATCGCCTTGA 
TAGCGGAACT 



TGCCGCCGTA 
ACGGCGGCAT 



GATGTCGAAG 
CTACAGCTTC 



GACATGCGGG 
CTGTACGCCC 



TGGAGGTGAA 
ACCTCCACTT 



851 GTAGGCCTCG 
CATCCGGAGC 



GCGGTGCCCA 
CGCCACGGGT 



CGTCGTGGAG 
GCAGCACCTC 



GAGGCAGGTG 
CTCCGTCCAC 



AGGGCCCAGG 
TCCCGGGTCC 
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FIGURE 2 (cont.) 

901 TGGACGGGGA GAGGTCCTTG GCCTGCTCCG GGAGGAGCCT CCTGGCrATP 
ACCTGCCCCT CTCCAGGAAC CGGACGAGGC CCTCCTCGGA GGACCGCTAG 

951 ACGGTGCCCC AGTAGAACAC GCGCATGGAG TGGTTGTAGG TCTCGCrrrA 
TGCCACGGGG TCATCTTGTG CGCGTACCTC ACC^CATCC AGAGGCCCCT 

1001 GAGGCGGGCC TTGACGAACG CCTGGGCCTC GGCCACGAGC TTCTrrrrrr 
CTCCGCCCGG AACTGCTTGC GGACCCGGAG CCGGTGCTCG I£c£gCCGGC 

1051 CTGGGAAGGC GATGTCCTCC ACGGAGTAGG AGCTGACGTC GCCCACCTTr 
GACCCTTCCG CTACAGGAGG TGCCTCATCC TCGACTGCAG CGGCTCGAAC 

1101 CCGAGGGAGT CCACGATGGC CTTGGCGCTG ACGGGGACCG ccrTmrrn 
GGCTCCCTCA GGTGCTACCG GAACCGCGA? ?GCCCCTGGC GCCAGGTCGG 

1151 GTTGGCCTTC ACCTCGGAGG ACGACATGGT GTCGTGT GGA^CC CrrrrTA 

CAACCGGAAG TGGAGCCTCC TGCTGTACCA CAGcicACCT AGGGGCCCAT 
EcoRI 

1201 CCGAATTCGT AATCATGGTC ATAGCTGTTT CCTGTGTGAA ATTPTTATrr 

GGCTTAAGCA TTAGTACCAG TATCGACAAA GGACACACTT TaIcIItJgI 

12 51 GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAArrrT 
CGAGTGTTAA GGTGTGTTGT ATGCTCGGCC TTCGTATTTC ACA^CGGA 

1301 ^GGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG 
CCCCACGGAT TAGTCACTCG ATTGAGTGTA ATTAACGCAA CGCGAGTGAC 

1351 CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATrrr 
GGGCGAAAGG TCAGCCCTTT GGACAGCACG GTCGACGTAA ^IotIgCC 

14 01 CCAACGCGCG GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTrrr 
GGTTGCGCGC CCCTCTCCGC CAAACGCATA ACCCG?Ga£a AGGCGAAGGA 

1451 CGCTCACTGA CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG AGCGRTATPA 
GCGAGTGACT GAGCGACGCG AGCCAGCAAG CCGACGCCGC TCGCcItAGT 

1501 GCTCACTCAA AGGCGGTAAT ACGGTTATCC ACAGAATCAG GPPATAArrr 
CGAGTGAGTT TCCGCCATTA TGCCAATAGG TGTCT?A5tc CCCTATTGCG 

1551 AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AAPPPT A A A a 
TCCTTTCTTG TACACTCGTT TTCCGGTCGT mcCGGTCC TTGGCATTTT 

1601 AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGAPPAPPAT 
TCCGGCGCAA CGACCGCAAA AAGGTATCCG AGGCGGGGGG IcTGCTCGtI 

1651 CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA 
GTGTTTTTAG CTGCGAGTTC AGTCTCCACC GCTTTGGGCT GTCCTGATAT 

1701 AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC 
TTCTATGGTC CGCAAAGGGG GACCTTCGAG GGAGCACGCG AGAGGACAAG 

1751 CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTPPPPAAPP 
GCTGGGACGG CGAATGGCCT ATGGACAGGC GGAAAGAGGG £[gcOTTCG 



SUBSTITUTE SHEET (RULE 26) 



WO 98/30701 



PCT/US98/00359 



5/6 

FIGURE 2 (cont.) 



1801 GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT 
CACCGCGAAA GAGTATCGAG TGCGACATCC ATAGAGTCAA GCCACATCCA 



18 51 CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC 
GCAAGCGAGG TTCGACCCGA CACACGTGCT TGGGGGGCAA GTCGGGCTGG 



1901 GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC 

CGACGCGGAA TAGGCCATTG ATAGCAGAAC TCAGGTTGGG CCATTCTGTG 

AlwNl 

1951 GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG 

CTGAATAGCG GTGACCGTCG TCGGTGACCA TTGTCCTAAT CGTCTCGCTC 



2001 GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT 
CATACATCCG CCACGATGTC TCAAGAACTT CACCACCGGA TTGATGCCGA 



2 051 ACACTAGAAG GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC 
TGTGATCTTC CTGTCATAAA CCATAGACGC GAGACGACTT CGGTCAATGG 



2101 TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG 
AAGCCTTTTT CTCAACCATC GAGAACTAGG CCGTTTGTTT GGTGGCGACC 



2151 TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG 
ATCGCCACCA AAAAAACAAA CGTTCGTCGT CTAATGCGCG TCTTTTTTTC 



2201 GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG 
CTAGAGTTCT TCTAGGAAAC TAGAAAAGAT GCCCCAGACT GCGAGTCACC 



22 51 AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT 
TTGCTTTTGA GTGCAATTCC CTAAAACCAG TACTCTAATA GTTTTTCCTA 



2 301 CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA 
GAAGTGGATC TAGGAAAATT TAATTTTTAC TTCAAAATT AGTTAGATTT 



2 351 GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG 
CATATATACT CATTTGAACC AGACTGTCAA TGGTTACGAA TTAGTCACTC 



2 4 01 GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT 
CGTGGATAGA GTCGCTAGAC AGATAAAGCA AGTAGGTATC AACGGACTGA 



2 4 51 CCCCGTCGTG TAGATAACTA CGATACCGGA GGGCTTACCA TCTGGCCCCA 
GGGGCAGCAC ATCTATTGAT GCTATGCCCT CCCGAATGGT AGACCGGGGT 



2 501 GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA 
CACGACGTTA CTATGGCGCT CTGGGTGCGA GTGGCCGAGG TCTAAATAGT 



2 551 GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 
CGTTATTTGG TCGGTCGGCC TTCCCGGCTC GCGTCTTCAC CAGGACGTTG 



2 601 TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA 
AAATAGGGGG AGGTAGGTCA GATAATTAAC AACGGCCCTT CGATCTCATT 



2 651 GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC 
CATCAAGCGG TCAATTATCA AACGCGTTGC AACAACGGTA ACGATGTCCG 
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FIGURE 2 (cont.) 



2701 ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC 
TAGCACCACA GTGCGAGCAG CAAACCATAC CGAAGTAAGT CGAGGCCAAG 



27 51 CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 
GGTTGCTAGT TCCGCTCAAT GTACTAGGGG GTACAACACG TTTTTTCGCC 



28 01 TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG 
AATCGAGGAA GCCAGGAGGC TAGCAACAGT CTTCATTCAA CCGGCGTCAC 



2 851 TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC 
AATAGTGAGT ACCAATACCG TCGTGACGTA TTAAGAGAAT GACAGTACGG 



2 901 ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT 
TAGGCATTCT AC G AAAAG AC ACTGACCACT CATGAGTTGG TTCAGTAAGA 



2 951 GAGAATAGTG TATGCGGCCA CCGAGTTGCT CTTGCCCGGC GTCAATACGG 

CTCTTATCAC ATACGCCGCT GGCTCAACGA GAACGGGCCG CAGTTATGCC 

As p700 

3 001 GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA 

CTATTATGGC GCGGTGTATC GTCTTGAAAT TTTCACGAGT AGTAACCTTT 
Asp700 

3 0 51 ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA 

TGCAAGAAGC CCCGCTTTTG AGAGTTCCTA GAATGGCGAC AACTCTAGGT 



3101 GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT 
CAAGCTACAT TGGGTGAGCA CGTGGGTTGA CTAGAAGTCG T AG AAAAT G A 



3151 TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 
AAGTGGTCGC AAAGACCCAC TCGTTTTTGT CCTTCCGTTT TACGGCGTTT 



32 01 AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT 
TTTCCCTTAT TCCCGCTGTG CCTTTACAAC TTATGAGTAT GAGAAGGAAA 



3251 TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC 
AAGTTATAAT AACTTCGTAA ATAGTCCCAA TAACAGAGTA CTCGCCTATG 



3301 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT 
TATAAACTTA CATAAATCTT TTTATTTGTT TATCCCCAAG GCGCGTGTAA 



3 351 TCCCCGAAAA GTGCCACCTG ACGTCTAAGA AACCATTATT ATCATGACAT 
AGGGGCTTTT CACGGTGGAC TGCAGATTCT TTGGTAATAA TAGTACTGTA 



34 01 TAACCTATAA AAATAGGCGT ATCACGAGGC CCTTTCGTC 
ATTGGATATT TTTATCCGCA TAGTGCTCCG GGAAAGCAG 
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